Supporting Undo and Redo in Scientific Data Analysis
نویسندگان
چکیده
This paper presents a provenance-based technique to support undoing and redoing of data analysis tasks. The technique targets scientists who experiment with combinations of approaches to processing raw data into presentable datasets. Raw data may be noisy and in need of cleaning, it may suffer from sensor drift that requires retrospective calibration and data correction, or it may need gap-filling due to sensor malfunction or environmental conditions. Different raw datasets may have different issues requiring different kinds of adjustments, and each issue may potentially be handled by different approaches. Thus, scientists must often experiment with different sequences of approaches. In our work, we show how provenance information can be used to facilitate this kind of experimentation with scientific datasets. We describe an approach that supports the ability to (1) undo a set of tasks while setting aside the artifacts and consequences of performing those tasks, (2) replace, remove, or add a data-processing technique, and (3) redo automatically those set aside tasks that are consistent with changed technique. We have implemented our technique and demonstrate its utility with a case study of a common, sensor-network, data-processing scenario showing how our approach can reduce the cost of changing intermediate data-processing techniques in a complex, dataintensive process.
منابع مشابه
A General Multi-User Undo/Redo Model
A general multi-user undo/redo model must satisfy several requirements It must be compatible with an existing single-user undo/redo model, give individual users autonomy in executing undo/redo commands, support undo/redo of remote commands and the remote effects of local commands, be independent of the coupling, multicast, and concurrency control model, and allow undo/redo of arbitrary commands...
متن کاملRelaxing Persistent Memory Constraints with Hardware-Driven Undo+Redo Logging
Persistent memory is a new tier of memory that functions as a hybrid of traditional storage systems and main memory. It combines the benefits of both: the data persistence property of storage with the fast load/store interface of memory. Yet, efficiently supporting data persistence in memory requires non-trivial effort. In particular, logging is a widely used data persistence scheme due to its ...
متن کاملPerformance of Recovery Algorithms for Centralized Database Management Systems
Database Recovery is responsible for preserving the database consistency after a failure of any kind (transaction, system or media). Relevant information solely for recovery is saved in a log during normal transaction processing. To recover from a failure, basically two operations: undo and redo are applied with the help of the log on the last consistent state of the database. These two operati...
متن کاملMulti-user Undo/redo
We have developed a multiuser undo/redo model by extending an existing single-user undo/redo model. The model, consisting of a semantic model and an implementation model, is applicable to general multiuser programs including programs ooering both WYSIWIS and WYSINWIS interaction, oor control and concurrent interaction, and atomic and non-atomic broadcast. The semantic model constructs the comma...
متن کاملReflections on Undo
Undo is seen as essential element of interactive systems. However, despite its prevalence users are often confused by its behaviour and developers often apply undo inconsistently within their systems. This report focuses on singleuser linear undo/redo systems It presents an abstract formal framework for modelling undo and related system extensions, a classification and taxonomy of undo and redo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013